Phonetically aided syntactic parsing of spoken language
نویسندگان
چکیده
The paper presents a technique for parsing a speech utterance from its phonetic representation. The technique is different from a conventional spoken language parsing techniques where a speech utterance is first transcribed at word-level and a syntactic structure is produced from the transcribed words. In a word-level parsing approach, an error caused by a speech recognizer propagates through the parser into the resultant syntactic structure. Furthermore, sometimes transcribed speech utterances are not parse-able even though lattices or confusion networks are used. These problems are addressed by the proposed phonetically aided parser. In the phonetically aided parsing approach, the parsing is performed from a phonetic representation (phone sequence) of the recognized utterance using a joint modeling of probabilistic context free grammars and a n-gram language model. The technique results in better parsing accuracy then word-level parsing when evaluated on spoken dialog parsing task in this paper.
منابع مشابه
An improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملبرچسبزنی خودکار نقشهای معنایی در جملات فارسی به کمک درختهای وابستگی
Automatic identification of words with semantic roles (such as Agent, Patient, Source, etc.) in sentences and attaching correct semantic roles to them, may lead to improvement in many natural language processing tasks including information extraction, question answering, text summarization and machine translation. Semantic role labeling systems usually take advantage of syntactic parsing and th...
متن کاملParsing Transcribed Spoken Language
This paper investigates some of the challenges that arise when parsing transcribed spoken language, as opposed to parsing written language. In particular, the paper has focus on identifying clauses as the object of syntactic analysis, and parsing data containg production errors.
متن کاملShallow Parsing of Spoken Estonian Using Constraint Grammar
In this paper we describe how we have adapted the syntactic analyzer of written Estonian to the spoken language. The Constraint Grammar shallow syntactic parser (Müürisep et al. 2003) was used for the automatic syntactic analysis of the corpus of Estonian spoken language (Hennoste et al. 2000). To adapt the parser, the clause boundary detection rules as well as some syntactic constraints had to...
متن کاملAdapting dependency parsing to spontaneous speech for open domain spoken language understanding
Parsing human-human conversations consists in automatically enriching text transcription with semantic structure information. We use in this paper a FrameNet-based approach to semantics that, without needing a full semantic parse of a message, goes further than a simple flat translation of a message into basic concepts. FrameNet-based semantic parsing may follow a syntactic parsing step, howeve...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012